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ENTERED 



3 <110> APPLICANT: E. I. du Pont de Nemours and Company 
5 <120> TITLE OF INVENTION: Flavonoid Biosynthetic Enzymes 
7 <130> FILE REFERENCE: BB1324 

9 <140> CURRENT APPLICATION NUMBER: 09/868, 546A 
10 <141> CURRENT FILING DATE: 2001-09-20 

12 <150> PRIOR APPLICATION NUMBER: 60/113,190 

13 <151> PRIOR FILING DATE: 1998-12-21 
15 <160> NUMBER OF SEQ ID NOS : 6 
17 <170> SOFTWARE: Microsoft Office 97 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 1859 

21 <212> TYPE: DNA 

22 <213> ORGANISM: Glycine max 

24 <400> SEQUENCE: 1 

25 gaaaacactg acagacagca tagtctctgg tgcaagaatc aattgagcaa gcatgggaat 60 

26 gttgttggtg gtggtctcat acgctgtcct tttcctggtt ctattcctcg gcgtgaagtt 120 

27 tgttttccaa agcagaaaat tgagaaacat accaccaggt cctcctcctc ttcccataat 180 

28 aggaaacctt aacctcctcg aacagccaat ccaccgtttc ttccaacgca tgtcgaaaca 240 
2 9 gtacggcaac gtggtttccc tctggttcgg ttcacgtctg gccgttgtca tctcctctcc 300 

30 aacagcatac caagaatgct tcaccaaaca cgacgttgcc ttggccaacc ggctaccttc 360 

31 tctctcggga aaatacatct tctacaacaa caccaccgta ggctcctgct cccacggcga 4 20 

32 gcactggcgc aacctccgcc gcatcaccgc cctggacgtc ctctccacgc agcgcgtcca 480 

33 ctccttctcc ggaatccgga gcgacgagac gaagcgtctg atgcagaggt tggtgctggc 540 

34 caagaactcg aacgaggaag agtttgcgcg agtggagatt agttcgatgt tcaacgactt 600 

35 aacttacaac aacataatga ggatgatatc ggggaagagg ttttacggag aggagagtga 660 

36 gatgaagaac gttgaggaag cgagggagtt cagagagact gtgacagaaa tgttggaact 720 

37 catgggcttg gctaacaagg gagatcactt gcctttcctc aggtggttcg attttcagaa 780 

38 tgtggagaag cgcttaaaga gtatcagtaa gaggtacgat tccatcttga ataagatcct 840 

39 tcatgagaac cgtgccagca atgaccgcca gaattccatg atcgatcatc tcctcaaact 900 
4 0 gcaagagacc cagcctcagt actacactga ccaaatcatc aaaggccttg ctctggccat 960 

41 gctttttggt ggaactgact catcaactgg gactttagag tggtcattat ctaatttatt 1020 

42 gaatcaccca gaggtgttga agaaggcaag agatgaattg gacactcaag tgggacaaga 1080 
4 3 ccgcttgtta aatgagtcag accttccaaa acttccatat cttaggaaga tcatccttga 1140 
44 gacacttagg ttgtaccccc cggccccaat tctaatacct catgtgtctt cagaagatat 1200 
4 5 tacaattgaa ggattcaata tcccacgaga cacaattgtg atcattaatg gttggggcat 12 60 
46 gcagagagat cctcagttgt ggaatgatgc cacatgcttt aaacctgaga ggtttgatgt 1320 
4 7 ggaaggagag gagaaaaagt tggtagcatt tggcatggga agaagggctt gcccaggaga 1380 
4 8 acccatggct atgcaaagtg tcagctttac tttgggattg ttgattcaat gttttgactg 1440 

49 gaaacgagta agtgaggaaa agcttgatat gacagagaac aattggatca ccttgtcaag 1500 

50 gttaattcca ttggaggcca tgtgcaaggc tcgcccactt gccactaaaa ttggaattta 1560 

51 attattaata gtatttttat ttggtaaact tgggtgattc agaatctaat acttataatt 1620 

52 ttagtgtgta agagtggtga tcatatatac atttcaaaat taataatctt tgtccaaaaa 1680 

53 tcatccatgg acaactatat gtcaattgac atctagagag aaatatagat ataagaatat 1740 
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54 ttatatttta ttactcttct ttatcttatg tgtcaaggcc cattgtagaa ttgggtgagc 1800 

55 attaacatat atcaatattg tataccgccc agttttctca aataaatttc tttactttc 1859 

57 <210> SEQ ID NO: 2 

58 <211> LENGTH: 499 

59 <212> TYPE: PRT > 

60 <213> ORGANISM: Glycine max 
62 <400> SEQUENCE: 2 



63 


Leu 


Leu 


Val 


Val 


IT- 1 

Val 


Ser 


Tyr 


Ala 


Val 


Leu 


Phe 


Leu 


Val 


Leu 


Phe 


Leu 


64 


1 








5 










10 










15 




66 


Gly Val 


Lys 


Phe 


Val 


Phe 


Gin 


Ser 


Arg 


Lys 


Leu 


Arg 


Asn 


He 


Pro 


Pro 


67 








20 










25 










30 






69 


Gly 


Pro 


Pro 


Pro 


Leu 


Pro 


He 


He 


Gly 


Asn 


Leu 


Asn 


Leu 


Leu 


Glu 


Gin 


l U 
















a n 
4 U 










A K 

4 3 








72 


Pro 


He 


His 


Arg 


Phe 


Phe 


Gin 


Arg 


Met 


Ser 


Lys 


Gin 


Tyr 


Gly Asn 


Val 


73 




50 










55 










60 










75 


Val 


Ser 


Leu 


Trp 


Phe 


Gly 


Ser 


Arg 


Leu 


Ala 


Val 


Val 


He 


Ser 


Ser 


Pro 


76 


65 










70 










75 










80 


78 


Thr 


Ala 


Tyr 


Gin 


Glu 


Cys 


Phe 


Thr 


Lys 


His 


Asp 


Val 


Ala 


Leu 


Ala 


Asn 


79 










85 










90 










95 




81 


Arg 


Leu 


Pro 


Ser 


Leu 


Ser 


Gly 


Lys 


Tyr 


He 


Phe 


Tyr 


Asn 


Asn 


Thr 


Thr 


82 








100 










105 










110 






84 


Val 


Gly 


Ser 


Cys 


Ser 


His 


Gly 


Glu 


His 


Trp 


Arg 


Asn 


Leu 


Arg 


Arg 


He 


85 






115 










120 










125 








87 


Thr 


Ala 


Leu 


Asp 


Val 


Leu 


Ser 


Thr 


Gin 


Arg 


Val 


His 


Ser 


Phe 


Ser 


Gly 


88 




130 










135 










140 










90 


He 


Arg 


Ser 


Asp 


Glu 


Thr 


Lys 


Arg 


Leu 


Met 


Gin 


Arg 


Leu 


Val 


Leu 


Ala 


91 


145 










150 










155 










160 


93 


Lys 


Asn 


Ser 


Asn 


Glu 


Glu 


Glu 


Phe 


Ala 


Arg 


Val 


Glu 


He 


Ser 


Ser 


Met 


94 










165 










170 










175 




96 


Phe 


Asn 


Asp 


Leu 


Thr 


Tyr 


Asn 


Asn 


He 


Met 


Arg 


Met 


He 


Ser 


Gly 


Lys 


97 








180 










185 










190 






99 


Arg 


Phe 


Tyr 


Gly Glu 


Glu 


Ser 


Glu 


Met 


Lys 


Asn 


Val 


Glu 


Glu 


Ala 


Arg 



100 195 200 205 

102 Glu Phe Arg Glu Thr Val Thr Glu Met Leu Glu Leu Met Gly Leu Ala 

103 210 215 220 

105 Asn Lys Gly Asp His Leu Pro Phe Leu Arg Trp Phe Asp Phe Gin Asn 

106 225 230 235 240 

108 Val Glu Lys Arg Leu Lys Ser He Ser Lys Arg Tyr Asp Ser He Leu 

109 245 250 255 

111 Asn Lys He Leu His Glu Asn Arg Ala Ser Asn Asp Arg Gin Asn Ser 

112 260 265 270 

114 Met He Asp His Leu Leu Lys Leu Gin Glu Thr Gin Pro Gin Tyr Tyr 

115 275 280 285 

117 Thr Asp Gin He He Lys Gly Leu Ala Leu Ala Met Leu Phe Gly Gly 

118 290 295 300 

120 Thr Asp Ser Ser Thr Gly Thr Leu Glu Trp Ser Leu Ser Asn Leu Leu 

121 305 310 315 320 

123 Asn His Pro Glu Val Leu Lys Lys Ala Arg Asp Glu Leu Asp Thr Gin 

124 325 330 335 
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126 Val Gly Gin Asp Arg Leu Leu Asn 



127 



340 



129 Tyr Leu Arg Lys lie lie Leu Glu 



Glu Ser Asp 
345 

Thr Leu Arg 



130 



355 



360 



132 Pro lie Leu lie Pro His Val Ser Ser Glu Asp 



133 



370 



37.5 



135 Phe Asn lie Pro Arg Asp Thr lie 



136 385 



390 



138 Gin Arg Asp Pro Gin Leu Trp Asn 

139 405 

141 Arg Phe Asp Val Glu Gly Glu Glu 



142 



420 



144 Gly Arg Arg Ala Cys Pro Gly Glu 



Val lie He 
395 

Asp Ala Thr 

410 
Lys Lys Leu 
425 

Pro Met Ala 



145 



435 



440 



147 Phe Thr Leu Gly Leu Leu He Gin Cys Phe Asp 



148 



450 



455 



150 Glu Glu Lys Leu Asp Met Thr Glu 

151 465 470 

153 Leu He Pro Leu Glu Ala Met Cys 

154 485 
156 He Gly He 

159 <210> SEQ ID NO: 3 

160 <211> LENGTH: 1698 

161 <212> TYPE: DNA 

162 <213> ORGANISM: Glycine max 

164 <400> SEQUENCE: 3 

165 cagtaataac aatgtctcct ttcttatctt 

166 ctctcaagta .ccttttccaa agaagcagaa 

167 ctcttcctat aatcggcaac cttaacctcg 

168 gcatgtccca aaaatatgga aacatcatat 

169 ttgtttcatc acccacagcg taccaagaat 

170 acagggtacg ctccctctcg ggaaaataca 

171 gctcccacgg cgagcactgg cgcaacctcc 

172 cgcagcgcgt ccactccttc tccggaatcc 

173 ggctggccag ggactccggg aaagattttg 

174 acttgacgta caacaacatc atgaggatga 

175 gtgaacttaa caacgttgag gaagcgaagg 

176 aactcatggg gttggctaac aagggagatc 

177 agaacgtgga gaagaggttg aagaatatca 

178 tccttgatga gaaccgtaac aacaaggacc 

179 aactgcaaga gacacagcct gactattata 

180 ctatgctctt tggtggaaca gactcgtcaa 

181 tagtgaatga cccagaggtg ctgcagaagg 

182 cagatcggct gttaaatgag tcagaccttc 

183 ttgaaacact taggttgtac cctccggctc 

184 acatcaatat cgaaggattc aatgttccac 

185 ccatgcaaag agatcctaag atatggaaag 

186 atgaagaagg agaggagaag aaattggtag 

187 gagaacccat ggctatgcaa agtgttagct 



Asn Asn Trp 
475 

Lys Ala Arg 
490 



Leu 

Leu 

He 
380 
Asn 

Cys 

Val 

Met 

Trp 
460 
He 

Pro 



Pro 

Tyr 
365 
Thr 

Gly 

Phe 

Ala 

Gin 
445 
Lys 

Thr 

Leu 



Lys Leu Pro 
350 

Pro Pro Ala 

He Glu Gly 

Trp Gly Met 
400 

Lys Pro Glu 

415 
Phe Gly Met 
430 

Ser Val Ser 

Arg Val Ser 

Leu Ser Arg 
480 

Ala Thr Lys 
495 



actctcttct 
aagtacgaaa 
ttgaacaacc 
ccctttggtt 
gtttcaccaa 
tattctacga 
gccgcataac 
ggagcgacga 
cgcgcgtgga 
tttcggggaa 
agttcagaga 
acttaccttt 
gtaagaggta 
gcgagaattc 
ccgaccaaat 
ctggaacttt 
caagagatga 
caaaacttcc 
caattctaat 
gagacacaat 
atgcgacaag 
catttggtat 
atactttggg 



ttccctcgtg 
cctgccacct 
tatacaccgt 
tgggtcacgt 
acatgatgtt 
caacaccacc 
ctctctcgac 
gacgaagagg 
gatgacctcc 
gcggttttac 
cactgtgaat 
cctaaggtgg 
tgataccatc 
catgattggt 
catcaaaggc 
agagtgggca 
gttggacgct 
ttatctcagg 
accacacgtg 
tgtgattatt 
ctttaaacct 
gggaagaagg 
attaatgatt 



ttcttcttca 
ggtccgactc 
ttcttccacc 
cttgttgtgg 
accttggcca 
gtagggtctt 
gttctatcga 
ttgatacaca 
aagtttgctg 
ggagaagaga 
gagatgctgc 
ttcgattttc 
ttgaataaga 
catctcctca 
cttgctttgg 
ttatctaatt 
caagtaggac 
aagatagttc 
gcttcagaag 
aatggttggg 
gagaggtttg 
gcttgcccag 
caatgttttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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188 actggaaacg agtaagtgag aagaagcttg atatgacaga gaataattgg atcaccttgt 1440 

189 caaggttaat tccattggag gctatgtgta aagcccgccc actcgccagc aaagttgaaa 1500 

190 gttattaaca atattttatt tggtatattt gggtgaggat ctaatactca taatttcggt 1560 

191 gtgtaagtct atgcatgtta aaattaataa tatttgtcgt atgtccacaa ggccaaatgt 1620 

192 agtactgggt gtggatttgc atatacaata tcaatattgt ataaatccca gtttccttga 1680 

193 ataaatttct ttactttc 1698 

195 <210> SEQ ID NO: 4 

196 <211> LENGTH: 494 

197 <212> TYPE: PRT 

198 <213> ORGANISM: Glycine max 

200 <400> SEQUENCE: 4 * 



201 


Leu 


Ser 


Tyr 


Ser 


Leu 


Leu 


Ser 


Leu 


Val 


Phe 


Phe 


Phe 


Thr 


Leu 


Lys 


Tyr 


202 


1 








5 










10 










15 




204 


Leu 


Phe 


Gin 


Arg 


Ser 


Arg 


Lys 


Val 


Arg 


Asn 


Leu 


Pro 


Pro 


Gly 


Pro 


Thr 


205 








20 










25 










30 






207 


Pro 


Leu 


Pro 


He 


He 


Gly 


Asn 


Leu 


Asn 


Leu 


Val 


Glu 


Gin 


Pro 


He 


His 


208 






35 










40 










45 








210 


Arg 


Phe 


Phe 


His 


Arg 


Met 


Ser 


Gin 


Lys 


Tyr 


Gly Asn 


He 


He 


Ser 


Leu 


211 




50 










55 










60 










213 


Trp 


Phe 


Gly 


Ser 


Arg 


Leu 


Val 


Val 


Val 


Val 


Ser 


Ser 


Pro 


Thr 


Ala 


Tyr 


214 


65 










70 










75 










80 


216 


Gin 


Glu 


Cys 


Phe 


Thr 


Lys 


His 


Asp 


Val 


Thr 


Leu 


Ala 


Asn 


Arg 


Val 


Arg 


217 










85 










90 










95 




219 


Ser 


Leu 


Ser 


Gly 


Lys 


Tyr 


He 


Phe 


Tyr 


Asp 


Asn 


Thr 


Thr 


Val 


Gly 


Ser 


220 








100 










105 










110 






222 


Cys 


Ser 


His 


Gly Glu 


His 


Trp 


Arg 


Asn 


Leu 


Arg 


Arg 


He 


Thr 


Ser 


Leu 


223 






115 










120 










125 








225 


Asp 


Val 


Leu 


Ser 


Thr 


Gin 


Arg 


Val 


His 


Ser 


Phe 


Ser 


Gly 


He 


Arg 


Ser 


226 




130 










135 










140 










228 


Asp 


Glu 


Thr 


Lys 


Arg 


Leu 


He 


His 


Arg 


Leu 


Ala 


Arg 


Asp 


Ser 


Gly 


Lys 


229 


145 










150 










155 










160 


231 


Asp 


Phe 


Ala 


Arg 


Val 


Glu 


Met 


Thr 


Ser 


Lys 


Phe 


Ala 


Asp 


Leu 


Thr 


Tyr 


232 










165 










170 










175 




234 


Asn 


Asn 


He 


Met 


Arg 


Met 


He 


Ser 


Gly 


Lys 


Arg 


Phe 


Tyr 


Gly 


Glu 


Glu 


235 








180 










185 










190 






237 


Ser 


Glu 


Leu 


Asn 


Asn 


Val 


Glu 


Glu 


Ala 


Lys 


Glu 


Phe 


Arg 


Asp 


Thr 


Val 


238 






195 










200 










205 








240 


Asn 


Glu 


Met 


Leu 


Gin 


Leu 


Met 


Gly 


Leu 


Ala 


Asn 


Lys 


Gly 


Asp 


His 


Leu 


241 




210 










215 










220 










243 


Pro 


Phe 


Leu 


Arg 


Trp 


Phe 


Asp 


Phe 


Gin 


Asn 


Val 


Glu 


Lys 


Arg 


Leu 


Lys 


244 


225 










230 










235 










240 


246 


Asn 


He 


Ser 


Lys 


Arg 


Tyr 


Asp 


Thr 


He 


Leu 


Asn 


Lys 


He 


Leu 


Asp 


Glu 


247 










245 










250 










255 




249 


Asn 


Arg 


Asn 


Asn 


Lys 


Asp 


Arg 


Glu 


Asn 


Ser 


Met 


He 


Gly 


His 


Leu 


Leu 


250 








260 










265 










270 






252 


Lys 


Leu 


Gin 


Glu 


Thr 


Gin 


Pro 


Asp 


Tyr 


Tyr 


Thr 


Asp 


Gin 


He 


He 


Lys 


253 






275 










280 










285 








255 


Gly 


Leu 


Ala 


Leu 


Ala 


Met 


Leu 


Phe 


Gly 


Gly 


Thr 


Asp 


Ser 


Ser 


Thr 


Gly 


256 




290 










295 










300 
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^ JO 


Thr Leu 


\j J-U. 


irp 


nld 


Leu 


Ser 


Asn 'Leu 


Val 


Asn 


Asp 


Pro 


blU 


vai 


Leu 


O R Q 


305 










J1U 




















320 


261 


Gin Lys 


Ala 

rt JLCl 






VJ J_ u. 


Leu 


Asp 


Ala 


Gin 


V a. J. 




Pro 


Asp 


Arg 


Leu 


O & 9 

z o z 










J z 0 










330 










335 




9 A 


Leu Asn 


pin 


Ser 


Asp 


Leu 


Pro 


Lys 


Leu 


Pro 


Tyr 


Leu 


Arg 


Lys 


lie 


Val 


9 ^ 
ZOO 








*5 a n 










345 










OCA 






9 fi7 

z u / 


Leu Glu 


T Vi r* 
± 111. 


Leu 


Arc[ 


Leu 


iyr 


Pro 


Pro 


Ala 


Pro 


lie 


Leu 


lie 


Pro 


His 


9 £ R 
ZOO 






occ 
JJJ 










JoU 










0 c c 

Sod 








9 7H 
Z / \J 


Val Ala 


Ser 


CjIU 


Asp 


Tl Q 

lie 


Asn 


lie 


Glu 


Gly 


Phe 


Asn 


Val 


Pro 


Arg 


Asp 


9 71 
Z / _L 


370 










•57c 

O / D 










0 0 n 
JoU 










9 7^ 

Z / J 


Thr lie 


V a. J_ 


Tin 

i le 


Tin 


Asn 




Trp 


Ala 


Met 


Cjin 


Arg 


Asp 


Pro 


Lys 


lie 


9 7/1 
Z / 4 


385 










o o n 

jy u 










*> A C 

395 










400 


z / o 


Trp Lys 


Asp 


Ala 


Thr 


Ser 


Pne 


Lys 


Pro 


Glu 


Arg 


Phe 


Asp 


Glu 


Glu 


Gly 


O 7 7 
Z / / 










405 










410 










415 




O 7 Q 

z / y 


Glu Glu 


Lys 


Lys 


Leu 


Val 


Ala 


Pne 


Gly Met 


Gly 


Arg 


Arg 


Ala 


Cys 


Pro 


Z O U 








420 










425 










430 






z oz 


Gly Glu 


Pro 


Met 


Ala 


Met 


Gin 


Ser 


Val 


Ser 


Tyr 


Thr 


Leu 


Gly 


Leu 


Met 


0 R "3 

zoo 






435 










440 










A A C 

445 








z o D 


He Gin 


Cys 


Phe 


Asp 


Trp 


Lys 


Arg 


Val 


Ser 


Glu 


Lys 


Lys 


Leu 


Asp 


Met 


ZOO 


450 










455 










460 










O Q Q 
ZOO 


Thr Glu 


Asn 


Asn 


Trp 


He 


Thr 


Leu 


Ser 


Arg 


Leu 


He 


Pro 


Leu 


Glu 


Ala 


z o y 


465 










470 










475 










480 


z y i 


Met Cys 


Lys 


Ala 


Arg 


Pro 


Leu 


Ala 


Ser 


Lys 


Val 


Glu 


Ser 


Tyr 






z y z 










485 










490 














z^ y 4 


<210> 


SEQ ID NO: 


: 5 
























z y d 


<211> 


LENGTH: 84 3 
























z y o 


<212> 


TYPE: 


DNA 


























z y / 


<213> 


ORGANISM: 


Glycine 


max 




















zy y 


<220> 


FEATURE : 


























■3 nn 


<221> 


NAME/KEY : 


unsure 






















Of|1 

JU1 


<222> 


LOCATION: 


(476) 




















J uz 


<223> 


OTHER 


INFORMATION: 


n = 


a, 


c, g 


or 


t 












JU4 


<220> 


FEATURE : 


























jUj 


<221> 


NAME/KEY : 


unsure 






















^ n £ 


<222> 


LOCATION: 


(657) 
























<223> 


OTHER 


INFORMATION: 


n = 


a, 


c, g 


or 


t 












309 


<220> 


FEATURE : 


























310 


<221> 


NAME/KEY : 


unsure 






















311 


<222> 


LOCATION: 


(703) 






















312 


<223> 


OTHER 


INFORMATION: 


n = 


a, 


c/ g 


or 


t 












314 


<220> 


FEATURE : 


























315 


<221> 


NAME/KEY : 


unsure 






















316 


<222> 


LOCATION: 


(712) 






















317 


<223> 


OTHER 


INFORMATION: 


n = 


a, 


c, g 


or 


t 












319 


<220> 


FEATURE : 


























320 


<221> 


NAME/KEY : 


unsure 






















321 


<222> 


LOCATION: 


(789) 






f 
















322 


<223> 


OTHER 


INFORMATION: 


n = 


a, 


c/ g 


or 


t 












324 


<220> 


FEATURE : 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 07/17/2003 

PATENT APPLICATION: US/09/868, 546A TIME: 15:37:50 

Input Set : A:\BB-1324 Corrected Sequence Listing.txt 
Output Set: N:\CRF4\07172003\I868546A.raw 

Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:5; N Pos . 4M , 6p , 7^)3 , 71^ 789^ 843 
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